|
|
Accession Number |
TCMCG021C10556 |
gbkey |
CDS |
Protein Id |
XP_010918844.1 |
Location |
complement(join(34142857..34143087,34143162..34143875,34144427..34144723,34147388..34147445,34149171..34149314,34149475..34149731,34155622..34155828,34158716..34158889,34173009..34173076,34187758..34187816,34198532..34198607,34200162..34200267,34201440..34201549,34214849..34215013,34222260..34222345,34223483..34223547,34232032..34232079,34232791..34232910,34236441..34236516,34238358..34238455,34238549..34238681,34244077..34244189)) |
Gene |
LOC105043124 |
GeneID |
105043124 |
Organism |
Elaeis guineensis |
|
|
Length |
1134aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA268357 |
db_source |
XM_010920542.2
|
Definition |
DNA mismatch repair protein MSH1, mitochondrial isoform X2 [Elaeis guineensis] |
CDS: ATGCACCGTGTGGTAACCAGCTCCCTCGTGGCGTCCTCACCTCGATGGCTCTCACTCGCGGGATTTCTCCGATCCTTCGCCATCCGGAGGCTCTACAAATCGCCGTTTCCAAGGCTTGTAGAAAGAAAATATTGTTCTAAATCACATGAAATTCTGGTCAGAGTACCTAAGGTCTCTGGAAGATTAAAACAGTCAAAAATTCTTTATGAGGTAGACCATCAGTCTCACATTTTGTGGTGGAAAGAGAAAATGCAGACGTGCAAAAAGCCTTCTTCAGTTCAGCTGATTAAGAGGCTCAAGTATACAAATTTATTAGGATTGGATGTTAGCCTTAGAAATGGAAGCTTAAAAGAAGGAACTCTCAACATGGAGTTATTGCAATTTAAATCGAAGTTTCCCCGTGAAGTTTTACTATGTAGAGTTGGAGATTTCTATGAAGCAATTGGATTTGATGCTTGTGTTCTTGTTGAGCATGCTGGTTTAAATCCTTTTGGGGGGTTGCGGTCAGATAGTATTCCAAGGGCCGGCTGCCCTGTTGTGAACTTGCGCCAAACTTTGGATGACTTGACTCGAAATGGGTTTTCTGTTTGCATAGTTGAGGAGGTTCAGAGCCCAACCCAGGCTCGTTCTCGTAAAAATCGATTTATATCTGGCCATGCACATCCAGGTAGCCCTTATGTATTTGGGCTTGCTGGGGTTGACCATGATGTTGAGTTTCCTGATCCAATGCCTGTAGTTGGGATCTCACATTCTGCAAAAGGATATTGCATGGTCTCAGTCCTAGAAACCATGAAAACATTTTCGTCAGAAGATGGCCTTACAGAAGAAGCAATAGTTACCAAGCTCCGCACATGCCGGTATCACCATTTATATCTGCACACTTCTTTGAGACAAAATTCTTCAGGTACTTGTCGCTGGGGAGAATTTGGTGAGGGGGGACTTTTGTGGGGAGAATGCAATGGAAAGCCCTTTGACTGGTTTAATGGTGATCCTGTCGAAGAGCTTCTATGCAAGGTAAGAGAGATATATGGTGTTGACCAAGAAACCACATTTCGGAATGTTACTATATATTCAGAGAGAAGGCCTCAACCTTTGTATCTTGGAACTGCAACTCAAATAGGAGTCTTACCAACCGAGGGAATTCCTAGCTTGTTGAAGGTTTTGCTTCCTGCAAACTGTGTTGGTCTTCCAATATTGTATATTCGAGATCTTCTTCTTAATCCTCCCACTTATGAGACTGCTTCGGCAATTCAAGAGACATGCAGGCTTATGAGCAATGTAACTTCTTCAATCCCTGAGTTTACTTGCATGTCAGCACCAAAGCTTGTGAAATTGCTCGAGTCAAAGGAGGTAAATCATGTAGAGTTCTGTAGAATAAAGAATGTAGTTGATGAAATTCTGCAGATGAGTAGAAGCACTGAGCTTGCTACAATCCTACATATACTGTTAGAACCAACTTGGGTAGCAACTGGACTGAAAGTTGAATATGATAGACTGGTGAATGAATGCAGTTTGGTTTCAAAAAGGATAGGTGAAATAATCTCCTTGGGTGGTGAAAGCGATCAGGAGATTAGTTCATTTGAATGCATTCCTAGGGAGTTCTTTGAGGATATGGAATCATCATGGAAAGGCCGTGTGAAGAGGATCCATGCAGAGGAGGCATTTGCAGAAGTGGAGAGGGCTGCCAAGGCCTTATCTGTTGCAGTTATGGAAGATTTGTTTCCAATTGTTTCAAGAGTCAAGTCTGTTGTCTCTTCTCTTGGAGGTCCAAAGGGGGAAATATGTTATGCAAGAGAGCATGAAGCTGTTTGGTTTAAAGGTAAGCGTTTCATGCCAGCTGTGTGGGCTAACACCCCTGGGGAAGAACAAATCAAGCAACTGAGACCTGCTATGGATTCAAAAGGGAGAAAGGTTGGAGAGGAATGGTTTACCACAATAAAAATTGAGGGTGCTCTAAACAGGTATCATGAAGCCAGTGATAAGGCAAAGAATAAAGTTTTGGAGTTATTAAGAGGACTTTCTGGTGAATTGCAGACAAATGCTAACATTCTTGTTTTCTCTTCCATGTTGCTTGTAATAGCGAAGGCACTTTTTGGTCATGTTAGTGAAGGCCGAAGAAGGGAATGGGTGTTTCCTAAGCTCAAGGAGTTTCACAGTCCTGAGGATAAGATAGCAGGAAACACTATCAAAATGGAGTTATCAGGATTATCTCCTTACTGGTTTGATGCGGCACAAGGCAATGCCATACAGAACACTGTTAAAATGCACTCGCTATTCCTTCTGACTGGGCCAAATGGTGGTGGTAAATCTAGTTTGCTTCGATCAATTTGTGCTGCTGCATTGCTTGGAATTTGTGGGCTTATGGTGCCTGCTGAGTCAGCTGTCATTCCTGATTTGGATTCTGTTATGCTGCACATGAAAGCTTATGATAGTCCTGCTGATGGGAAAAGTTCATTTCAGATTGAGATGTCAGAAATGCGCTCCATAATCACTAGAGCTACCCCAAGGAGCTTAGTTCTTGTGGATGAAATCTGTAGAGGCACAGAAACTGCTAAAGGAACCTGTATTGCTGGTAGCATTGTTGAGATGCTTGATTGCACTGGCTGCCTGGGCATCGTATCAACCCATTTGCATGGCATTTTTGACTTGCCTTTAGCCACAAAAAATACCGTCCACAAAGCAATGGGAACAGAGGTTGCAGATGGCCGCATAAGACCAACATGGAAGTTGATAGATGGAGTGTGTAGAGAGAGTCTTGCCTTTGAAACTGCCCAGAAGGAAGGCATTCCCGAAAAAATCATTCAAAGAGCCGAAGAGCTATATCTCTCAATGAATGTGACTGATTCACGCATTGCTCCAAATTCTACAAAAGCTGAGCATTTCAATGCAAAGTCTAATGCAAGGGGTCTTGGTGAAATCTGTGATTCTTCAAGGACTAGTTTAGATTTTCTTCCTTCTGGCAACTTGGAACTATCACAGAAGGAAGTTGAGAGTGCGGTTACCATAATCTGCCAGAAGAAGTTGATAGAGCTTTACAAGAAGAAAAGCATATCTGAGCTTGCAGAGGTGATGTGTGTTGCAGTTGGTGCTAGGGAGCAGCCTCCGCCCTCTAGCGTGGGCACTTCCTGCATCTATGTACTCTTCAGGCCTGACAAGAAATTATATGTTGGACAGACGGATGATCTAGTGGGCCGAGTTCGTGCTCATCGTTCCAAGGAAGGCATGCAAAATGCGGTGTTCCTATATGTTATAGTACCAGGAAAGAGCATTGCGAGTCAACTTGAGACCCTTCTCGTCAACCAGCTCCCCCTTCGAGGTTTCAGGCTTGTCAACAAAGCTGATGGTAAGCATCGTAATTTTGGCACATCTAGACTCCCCATAGAAGCCATTACGTTGCACCAATGA |
Protein: MHRVVTSSLVASSPRWLSLAGFLRSFAIRRLYKSPFPRLVERKYCSKSHEILVRVPKVSGRLKQSKILYEVDHQSHILWWKEKMQTCKKPSSVQLIKRLKYTNLLGLDVSLRNGSLKEGTLNMELLQFKSKFPREVLLCRVGDFYEAIGFDACVLVEHAGLNPFGGLRSDSIPRAGCPVVNLRQTLDDLTRNGFSVCIVEEVQSPTQARSRKNRFISGHAHPGSPYVFGLAGVDHDVEFPDPMPVVGISHSAKGYCMVSVLETMKTFSSEDGLTEEAIVTKLRTCRYHHLYLHTSLRQNSSGTCRWGEFGEGGLLWGECNGKPFDWFNGDPVEELLCKVREIYGVDQETTFRNVTIYSERRPQPLYLGTATQIGVLPTEGIPSLLKVLLPANCVGLPILYIRDLLLNPPTYETASAIQETCRLMSNVTSSIPEFTCMSAPKLVKLLESKEVNHVEFCRIKNVVDEILQMSRSTELATILHILLEPTWVATGLKVEYDRLVNECSLVSKRIGEIISLGGESDQEISSFECIPREFFEDMESSWKGRVKRIHAEEAFAEVERAAKALSVAVMEDLFPIVSRVKSVVSSLGGPKGEICYAREHEAVWFKGKRFMPAVWANTPGEEQIKQLRPAMDSKGRKVGEEWFTTIKIEGALNRYHEASDKAKNKVLELLRGLSGELQTNANILVFSSMLLVIAKALFGHVSEGRRREWVFPKLKEFHSPEDKIAGNTIKMELSGLSPYWFDAAQGNAIQNTVKMHSLFLLTGPNGGGKSSLLRSICAAALLGICGLMVPAESAVIPDLDSVMLHMKAYDSPADGKSSFQIEMSEMRSIITRATPRSLVLVDEICRGTETAKGTCIAGSIVEMLDCTGCLGIVSTHLHGIFDLPLATKNTVHKAMGTEVADGRIRPTWKLIDGVCRESLAFETAQKEGIPEKIIQRAEELYLSMNVTDSRIAPNSTKAEHFNAKSNARGLGEICDSSRTSLDFLPSGNLELSQKEVESAVTIICQKKLIELYKKKSISELAEVMCVAVGAREQPPPSSVGTSCIYVLFRPDKKLYVGQTDDLVGRVRAHRSKEGMQNAVFLYVIVPGKSIASQLETLLVNQLPLRGFRLVNKADGKHRNFGTSRLPIEAITLHQ |